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Abstract 

This paper addresses the problem of estimating the extreme value 
index in presence of random censoring for distributions in the 
Weibull domain of attraction. The methodologies introduced in 
Worms (2014)], in the heavy-tailed case, are adapted here to the 
negative extreme value index framework, leading to the definition of 
weighted versions of the popular moments of relative excesses with 
arbitrary exponent . This leads to the definition of two families of 
estimators (with an adaptation of the so called Moment estimator 
as a particular case), for which the consistency is proved under a 
first order condition. Illustration of their performance, issued from 
an extensive simulation study, are provided. 
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1 Introduction 


Extreme value statistics is an active domain of research, with numerous fields of ap¬ 
plication, and which benefits from an important litterature in the context of i.i.d. 
data, dependent data, and (more recently) multivariate or spatial data. By contrast, 
methodological articles in the case of randomly censored data are quite recent and 
few : |Einmahl et al. (2008) | presents a general method for adapting estimators of 
the extreme value index in a censorship framework (a methodology based on a previ¬ 
ous work jBeirlant et al. (2007)| ), |Diop et al. (2014)] extends the framework to data 
with covariate information, and | Worms (2014)] proposes a more survival analysis- 
oriented approach restricted to the heavy tail case. Other existing works on the 
topic of extremes for censored data are |Brahimi et al. (2013)] and the review paper 
Gomes and Neves ( 2011 )]. 


In this paper, the topic of extreme value statistics for randomly censored data 
with negative extreme value index is addressed. Our initial purpose was to rely on 
the ideas of |Worms (2014) in order to define a more ’’natural” version (with respect 
to that proposed in |Einmahl et al. (2008)] ) of the moment estimator in the context 
of censored observations. We finally came out to propose weighted versions of the 
popular moments of the relative excesses (with arbitrary exponent), and therefore 
define competitive estimators of the extreme value index in this censoring situation, 
for distributions in the Weibull maximum domain of attraction. 

Let us first define more precisely the framework, the data, and the notations. 


In the classical univariate framework of i.i.d. data, a central task is to estimate 
the extreme value index 7 , which captures the main information about the behavior 
of the tail distribution of the data. More precisely, a distribution function (d.f.) F is 
said to be in the maximum domain of attraction of H 7 (noted F e D(H^)) with 

jj , v _ f exp (—(1 + 7 a;) -1 / 7 ) for 7 0 and 1 + 72 ; > 0 

7 X ' | exp(— exp(— x)) for 7 = 0 and iel, 

if there exist two normalizing sequences (a n ) cz R + and (b n ) a R such that 

F n (a n x + b n ) ™ h^x) (VxeR). 

We consider in this paper two independent i.i.d. non-negative samples (Xj)j^ n and 
(Ci)isSn with respective continuous distribution functions F and G (with end-points 
t f and tg, where t f := sup{x, F(x) < 1}). In the context of randomly right-censored 
observations, one only observes, for 1 < z < n, 

Zi = Xi a Ci and 8 t = Ix^cy 

We denote by H the distribution function of the Z-sample, satisfying 

1 -H = (1 — F)(l — G) 

and by Z^ n ^ ^ Z n>n the associated order statistics. I 11 the whole paper, 

Si tn , ..., S n>n denote the S’s corresponding to Z l n ,..., Z n<n , respectively. F and G 
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are assumed to belong to the maximum domains of attraction D(H lx ) and D(H lc ) 
respectively, where yx and i G are real numbers, which implies that H e D(H^), for 
some y eK. 


Our goal is to estimate the extreme value index yx in this context of right cen¬ 
sorship. The most interesting cases, described in |Einmahl et al. (2008)], are the 
following : 


case 1: 

lx > 0 , ic > 0 

case 2: 

lx < 0 , ic < 0 , t f = t g 

case 3: 

lx = ic = 0 , t f = t g = +oo 


in this case y 

in this case y 
in this case y 


yxy c 

7.Y + 7c 
7x7 c 

7.Y + 7c 

0 . 


In |Worms (2014) |, case 1 above was considered and an adaptation of the so-called 


Hill estimator to the right censoring framework was proposed. In this paper, our aim 
is to consider case 2 above and adapt the approach leading to the so-called Moment 
Estimator to this censored situation. An adaptation of this estimator was already 
proposed in |Einmahl et al. (2008)] : it consists in dividing the classical Moment 
Estimator y^ of y (calculated from the Z-sample) by the proportion 


P : = K 


i+ 1 , n 


of uncensored data in the tail, where k n is the number of upper order statistics 
retained. Note that y)f is an appropriate combination of the following moments 


M 



1 

u 


kn 


J n— 2+1,n 


J n—k n ,n 




for a = 1 or 2 (where log“(x) stands for (log(x))"), and that p estimates the ultimate 
proportion p of uncensored observations in the tail, which turns out to be equal to 


lx ix + y c 


Our goal is to show that relying on usual strategies in the survival analysis lit¬ 
erature leads to estimators of yx which are often sharper than those obtained by 
simply dividing an estimator of y by the proportion of uncensored observations. By 
“usual” strategy we mean using “Kaplan-Meier”-like random weights : we refer to 
Worms (2014)| for more detailed informations concerning the origin of the two kinds 
of random weights appearing in the formulas below. As a matter of fact, we define, 
for any given a ^ 1, the following two versions of randomly weighted moments of the 
log relative excesses : 


Mi 


(a) 


n.kn 


kn 

s 


Sn- 


n— 2+1,n 


n (l F n (Z n -k ni n)) i= l 1 G n (Z n _ i+1 ) 


log c 


^n—2 +l,n 
7 

Zj n—k n ,n 


(i) 
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and 


1 


M. 


(«) . . 


n,k n 


_I_ y _ 1 

n (l — F n (Z n -k„,n )) i=i 1 — G n {Z n _ i+l n ) 




where 


fi,„ := i ( log” ( | - log" 


7 

^n—i.n 


7 ) \ 7 

^n—knjn / \ ^n—knyTi 

and (fc n ) is a sequence of integers satisfying, as n tends to +oo, 

k n —> +oo and = o(n). 


( 2 ) 

(3) 

(4) 


Above, F n and G n naturally denote the Kaplan-Meier estimators of F and G, respec¬ 
tively, defined as follows : for t < Z n n , 


i-p n (t)= n 




n — i 
n — i + 1 


and 1 — G n (t) = ]~[ 


Zi, n *a 


71 — 1 

n — i + 1 


1 — 5 ,- 


It should be noted that these 2 weighted versions of the moments of the log- 
excesses defined in (1) and (2) are in fact closely related : as a matter of fact, they 
differ only when the maximum observation Z n;n is censored (when 5 n , n = 1, we have 

5). However, both versions deserve 


indeed = M^ n , see Proposition 


attention : firstly because in practice t 


in Section 

ic last observation is often a censored one, and 
secondly because when they do differ, the difference is the only term involving the 
information contained in the maximum observation Z nn (this difference is therefore 
non-asymptotically not negligible, although it tends to 0 in probability, as stated in 
Proposition [I] in Section [5]) . 

In section [2] below, assumptions are presented and discussed, convergence results 
for the weighted moments and are stated, and we describe how classes 

of estimators of jx can be deduced by combining these moments for different values 
of a. In Section [3j performance of these estimators will be presented on the basis of 
simulations. Section [4] provides some words of conclusion, Section [5] is devoted to the 
proof of Theorem 1 below, and finally the Appendix includes standard (but central to 
our proofs) results on regularly varying functions, as well as the proofs of the different 
lemmas which were used in Section [5] 


2 Results 

2.1 Assumptions 

In addition to Q, our results need the following minimal assumption : 

(A) F e D(H lx ), G e D{H lc ) with < 0 , 7 c < 0 and x* := tf = tq- 

As noted earlier, this assumption implies that FI e D^H^) with = x* and 

7x7 c ^ n 
7 = -< 0. 

7 x + 7 c 
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If we note U(t) = H*~( 1 — 1 /t) the quantile function associated to H , then x* = U (go) 
and H e D(H^) is equivalent to the existence of some positive function a such that 


log U(tx) — log U(t) 

hm - 7 , —— . - 

i->+co a(t)/U(t ) 

which, since 7 < 0, is itself equivalent to 

U (go) - U(tx) 
t-Too U (go) - U{t) 


nrl — 1 

-, Vx > 0, 

7 


= x 7 , Vx > 0. 


(5) 

( 6 ) 


This means that the function [/(go) — U is regularly varying (at +oo) with index 7 
(see the appendix for the definition of regular variation at +00). A reference for the 
equivalence of conditions (J5| and (j6]) to (A) is |Haan and Ferreira (2006) | (respectively 
relation (3.5.4) and Corollary 1.2.10 there). 

Finally, we will need some very mild additional assumption on (k n ) 


(K) there exists some 8 > 0, or some 8 ^ 
such that 


lx ~ 7 c 
lx + 7 c 


-log (k n /n)/k n = 0(n s ). 


if 7 c > lx, 


(7) 


2.2 Asymptotic results 

Let us introduce the notation a n ^ '■= a{n/k n )/U{n/k n ) (see the previous paragraph 
for the definition of functions U and a), where a n ^ —> 0 (c/ equation (3.5.5) in 
Haan and Ferreira (2006)]). In the paper, Beta(-, •) denotes the usual Beta function, 
Beta(a, b ) = t a_1 (l — t) b ~ l dt (a > 0, b > 0). 

Theorem 1 Under assumption (A) and conditions Q and (K), for any given a ^ 1, 
both 7 n, A and , n, A converge in probability, as n tends to go, to 

\ a n,k) \ a n,k) 

\lx\~ 1 \l\~ a Beta(\ix\~ 1 ; a + 1). 

The following corollary states the consistency of our two different adaptations of 
the Moment estimator to this censored framework. 


Corollary 1 Under conditions of Theorem^ 7J as n —> go, 


In,Mom '■ M n kn + 1 o I f 


(<1) 2N 


<1 


lx 


and 


In,Mom +1 „ I 1 


1 

2 




M 


■( 2 ) 


n,k n 


lx- 
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In fact, by using the elementary properties of the Beta function, the weighted 
moments or can be combined in different ways, leading to the definition 
of two different classes of consistent estimators of j x , parametrized by a ^ 1 (proofs 
of the 3 corollaries are easy and omitted). In the next section, we study their finite 
sample performance. 


Corollary 2 Under conditions of Theorem [IJ as n —* oo, 

7 { n} ■= (KTa + « + 1)^ lx 

and 


where 


7ig (y n ,a + a + 1 

a + 2 (Mn“ +1) ) 2 


lx 


J r, , “ + 2 (* f ~“ +1> ) 2 

and V n a := 1-— -r. 

a + 1 Mi a) Mi a+2) 


Vn ' a ' 1 a + 1 Mi a) Mi a+2) 


Corollary 3 Under conditions of Theorem^ IJ as n -+ cc, 


and 


where 


In,2 ■ — 

1 (of + l)-^72,Q! 

p 

(a + 1)(1 — R n ,a) 

* lx 

~(a) 

In,2 ■= ' 

1 (of + \^R nOL 

p 

(a + 1)(1 — R n ,a) 

* lx, 


M ^ 

— r— and R„,, 

Mn ] Mn 


M, 


(a+l) 


M 


(a+l) 


Remark 1 It is straightforward to see that 7^ with a = 1 equals 1 — 1(1 — R n ,i) 1 , 
which is very close to in,Mam, since il^ 1 ^ —* 0 in our finite endpoint framework. 

Remark 2 If denotes the unweighted moments defined in the introduction, it 

can be proved that under (A) and Q, for a ^ 1, 


ifln,k 

Therefore, it is easy to check that combining those moments as described in Corollaries 
pi and fsl leads to consistent estimators of 7, and thus dividing the latter by p (defined 
in the introduction) leads to 2 classes of consistent estimators % f and of jx- We 
also define i n ,Mom as the estimator of ix obtained by dividing the classical Moment 
estimator of 7 by the proportion p. A finite-sample comparison of those estimators 
with our new competitors is presented in the following section. 


t — | T |- a “ 1 ^ eto (| 7 |- 1 5 « + !)■ 
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Remark 3 Note that the combination of moments proposed in Corollaries 2 and 
3 become inadequate in the framework of a positive extreme value index: it can be 
indeed proved that, in this framework, the combinations 7^“- and 7^“ ■, for j = 1 

or 2, converge in probability to zero, by proving that and converge in 

probability to 7£r(a + 1) (in the complete data case, this result is known for , 

see (Segers (2001)1). This could suggest, in the positive index case, the definition 
of estimators of 7^ which would be equal to 7^“- or 7^“- (for j = 1 or 2) plus a 
’’censored version” of the Hill estimator (in the same spirit as the definition of the 
Moment estimator, which equals the Hill estimator plus a term converging to 0 in the 
positive index case). 


3 Finite sample behavior 


The goal of this Section is to present onr results concerning the finite sample per¬ 
formances of our new estimators of the extreme value index in presence of random 
censorship, presented in Corollaries m and [3] In each case considered, 2000 random 
samples of size n = 500 were generated, and the median bias and mean squared error 
(MSE) of the different estimators of 7 were plotted against the number k n of excesses 
used. 

A great variety of situations can be (and has been) considered in our simulation 
study : various values of 7^ and 7 c (and therefore various censoring rates in the 
tail), various families of underlying distributions (Reverse Burr, generalized Pareto, 
Beta), and choice of the value of a. It is impossible to illustrate here the different 
possible combinations of these features : we will therefore try to draw some general 
conclusions from the many different situations we have observed, and provide a partial 
illustration with 3 particular cases. 

Concerning the choice of the tuning parameter a , we did not find a value which 
seemed preferable in every situation : nonetheless, in general, for small values of k n , a 
value of a around 1 or 2 yields better MSE, whereas for high values of k n , the MSE is 
lower for values of a greater than 2. We decided not to include this preliminary study 
in this article, and chose (almost arbitrarily) the value a = 2 in all our subsequent 
simulations. 


Let us now settle the vocabulary used in this section. We will call Moment es¬ 
timators the estimators 7 n ,Mom and 7 n ,Mom appearing in Corollary [lj as well as the 
estimator 7 n ,Mom introduced in Remark [2] above. We will call type 1 (resp. type 2) 


estimators the estimators 7^ and 7^ (resp. 7^ and 7,^ 


(resp. 


3), as well as the estimator 7^ (resp. jff ) introduced in Remark 


Oh 


n,2. 


appearing in Corollary 


We will also consider names for the different methods : the KM method (for 
Kaplan-Meier-like weights, appearing in the definition of ), leading to 7 es¬ 

timators, the L method (for Leurgans-like weights) leading to 7 estimators (the 
name comes from the mathematician Sue Leurgans who inspired the weights, see 
Worms (2014)| for details and a reference), and the EFG method (for constant 
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Figure 1: Comparison between 7/ [ (thick black), 7/ [ (dashed black), 7/ j (thin 
black), 7 n,Mom (thick grey), 7 n ,Mom (dashed grey) and *f n ,Mom (thin grey) for a 
RevBurr(l, 1,1,10) censored by a RevBurr(10, 2/3,1,10) (yx = —1 > 7c = —3/2, 
p=2/5, weak censoring) 


weighting by p), leading to 7 estimators (the names comes from the initials of the 
authors of |Einmahl et ah (2008)]). 

There are two main questions addressed in this empirical study : is one of the 
3 methods preferable to the others (and in which conditions) and is there a better 
choice for the type of estimator (type 1 , type 2, or classical Moment estimator) ? 
Unsurprisingly, after our intensive simulation study, we may say that the answer is 
no for the 2 questions, if an overall superiority is looked for. However, we can make 
some partial comments concerning the choice of the method and of the estimator 
type, whether the censoring is strong or weak, or the value of |y| is small or not. 

Note first that, if the censoring rate 1 — p in the tail is very low (say lower than 
10%), we observed that there was not much difference between the 3 methods (KM, 
L, EFG), and that it was just a question of choosing between type 1, type 2, and 
moment estimator. This is why, in the following, we only consider cases where the 
censoring rate 1 —p = ^ is larger than 1/4, and talk about strong censoring in the 
tail when this rate is greater than 1/2 (he. yx < yc) , and weak censoring otherwise 
(when yx > 7c) • 

For “high” values of yx, he. lower than —1/2, we have most of the time observed 
better performance of the KM and L methods with respect to the EFG method, 
in strong or weak censoring frameworks. In this context, the type 1 estimators are 
generally preferable to the type 2 estimators, and comparable or preferable to the 
moment estimator. 

For values of yx between —1/2 and 0 (sometimes called the “regular” case, and 
which is the most frequently encountered in practice), there exists a great variety 
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Figure 2: Comparison between 7/ 2 (thick black), 7/ 2 (dashed black), 7/ 2 (thin 
black), 7 n ,Mom (thick grey), % )M om (dashed grey) and 7 n , M * (thin grey) for a 
RevBurr(l, 8,1/2,10) censored by a RevBurr(10,4,1/2,10) (7* = —1/4 >7 c = 
—1/2, p=l/3, weak censoring) 

of situations. We observed that the moment estimators were generally better than 
the type 2 estimators, which were themselves generally better than the type 1 ones. 
Concerning the choice of the method, for the moment estimator, it seems difficult 
to suggest a particular one, between the KM, L, and EFG methods (even though in 
many cases, at least one among the KM and L methods was better than the EFG 
method). Concerning the inferiority of types 1 and 2 versus the moment estimator, it 
should be noted that it is mainly clue to the bias, which contributes the most to the 
MSE (in fact, we clearly noticed that the variances of the types 1 and 2, for a = 2, 
are almost always lower than the variance of the moment estimator). 

The 3 particular situations we chose as illustrations of the comments above involve 
the Reverse Burr class of distributions RevBurr(/3,r, \,x*) (with /3,r, A > 0) : its 
survival function is 

P(X > x) = (1 + P~\x* - x )~ T )- x , 
and its extreme value index is — l/(Ar). 

In Figure 1, the value of 7^ is lower than —1/2, and therefore, as motivated above, 
for readability purposes we only kept the type 1 estimators on the graph, whereas for 
the other two figures, the value of 7^ is between —1/2 and 0 and we therefore only 
kept the type 2 estimator illustrated. Remind here that these 3 examples are only 3 
particular cases of the numerous combinations of features we have considered in our 
simulation study. 
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Figure 3: Comparison between 7 / 2 (thick black), 77 2 (dashed black), 7 / 2 (thin 
black), 7 n,Mom (thick grey), 7 n ,Mom (dashed grey) and 7 n ,Mom (thin grey) for a 
RevBurr(10, 8,1/2,10) censored by a RevBurr(10, 5,1,10) (7* = —1/4 <7 c = —1/5, 
p=5/9, strong censoring) 


4 Conclusion 


In this paper, we applied the methodology introduced in |Worms (2014) to define 
weighted versions of the moments of relative excesses, and consequently construct new 
estimators of the extreme value index for randomly-censored data with distributions 
in the Weibull domain of attraction. We proposed, in particular, a new adaptation 
of the famous Moment estimator. Our intensive simulation study shows that the 
proposed estimators are competitive even if, in many cases, the bias would need to 
be reduced. A future possible work would be to exploit our weighting methodology 
in order to estimate other parameters of the tail (for reducing the bias, for example) 
as well as extreme quantiles. The asymptotic normality remains a question to be 
addressed (difficulties come from the control of the Kaplan-Meier estimates in the 
tail). 
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5 Proof of Theorem HI 


Before proceeding to the proof of Theorem [lj we state the following Proposition which 
explains the link between our two proposals of weighted moments. 

Proposition 1 W For any a Ss 1, » M { “^ + (1 - K,n)D { ^, where 


= 


7l(l — F n {Z n -k n ,n)){ 1 — G n (Z nn )) 


log° 


J n,n 


Z n — 


n—k n ,n 


(ii) Under the same assumptions as Theorem^ for any a ^ 1, we have D 


Op (a 


(a) 

n,k n 


n,k) 


According to this proposition, the validity of Theorem [lj for is a consequence 

of Theorem [l] for . Proposition [l] will be proved at the end of the appendix. 

We now need to state the following technical Lemmas, which will be proved in the 
appendix. The notation £ i>n was introduced in ([3]) and we note 


7 . ._ 

^2,72 • 


% ^n—i+l,r 


rp _ 7 

X ZJr.— 


n—k n ,n 


Lemma 1 Let a ^ 1 and Ui = Ui^ n = 


(i ) if 0 < a < 1 then 


. Under assumptions (A) arid (j^Jb 


k a a 

n i=1 a n,k 

(ii) if a > l, then for any S' > 0, 


—a f,i,n. P 


(1 — a.) I'y | " 1 Beta 


1 — a 


; a + 1 ) , 


1 


2 


-a fi,n P 


Ua+S 1 n a 

N. i=1 u n,k 


Lemma 2 For any given positive exponents 6 and O’ > 0, there exist constants c > 1, 
d < 1 both arbitrarily close to 1, and a + > 0, a_ > 0, arbitrarily close to 76 1 and to 
76*' respectively, such that 


lim P max 


co l iscfc n cu. a+ 


>l| = lim P min 

n—► go \ i^k n r J 


i,n 


<1=0. 


C U- 


simi- 


We now proceed to the proof of Theorem 1 (for ), which has structural 
larities with the proof of Theorem 2 in Worms (2014)]. We shall refer to the latter 
when necessary. We have the decomposition 


A ill - Mw n + K) . 
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where 


\ _ 1 G n (Zn — k n ,n) 

An ■“ 1 -G(Z n _ kntn ) 
i 


«-=iS£l (Ci.n-lW.r 


w ■= — y kn w 

vv n ■ kn Zji= 1 ” ®, rl > 

f~i . 1—G(Z K _,-fi ;n ) 

i-S n (z-_ 4+liB )’ 


and 


Tj, 1 G(Z n _k n ,n) c 

VV i,n ■— 1 nl V \ 

1 Lt yZjn—i+i^nj 


Since A n —> 1 as n —» oo (see Theorem 2 in jCsorgo (1996)]), we need to prove that 
R n = op(a“ fc ) and 

^ — *«, (8) 


*n,l- 


where l a denote the limit in the statement of Theorem [lj 


5.1 Proof of W n /a® k 


- lr 


Since G e D(H lc ) 1 with 7<7 < 0, is equivalent to t —*■ 1 — G(x* — t) being regularly 
varying at 0 with index — I/7C (see the appendix for the definition of regular variation 
at 0), the bounds (12) in Corollary [T] (in the appendix) applied to f = G, x = 
(x* - Z n _ i+ 1,„)/(£* - Z n _ kn:7l ) and t = x* - Z n _ kri . n , yield, for e > 0, n sufficiently 
large and every 1 7 i 7 k n , 


(1 - e) Z& S m,„ < (1 + e) Z7; 

Let ?] > 0. We first write, for e sufficiently small, 


WnK,k - i« >») < p (*£‘ Si:, zg “'ir - <«> I), 

1 Y^' 


n,k 


(9) 


P( /a - lPn/a“ fc > T? ) < P( L - k~ ^S > I )■ 

n,k 

Let us now consider constants c > 1 and d < 1, both arbitrary close to 1, and a + > 0 
and a_ > 0 both arbitrary close to 7/7 c ■ These constants come from the application 
of Lemma [2] above with 9 = 7^ — e and 6' = 7^ + e. Using positivity of £ in , it comes 


P 


W r . 


l n,k 


la > V 


7 P max 


zl° 


CU ■ a+ 


> 1 


HD I 1 n 

PI L -— > r] 


a. 


n,k 


+pi 


i= 1 

z 7 ^ 1+e 

7 P I min l,n _ n < 1 

c i u 


l, n,k 




+ P ( l a - c' k n 1 2 
2=1 


—a_ T 


in ^ 9 
a >2 


l n,k 
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where Ui = Ui }U = i/(k n + 1) for 1 < i < k n . If we call l a ^ a the limit in the statement 
of Lemma [I] ( i ), and if we apply Lemma [2] as indicated previously, we have 


lim sup P ( — l a > rj ) 

71 —>00 ' a n,k J 

lim sup P ( l a — ^r- > ^ ) 

n_.rn \ n,K / 


< 


< 


lim sup P 

n—>00 

lim sup P 

n—> oo 



Since l a = ^ a , 7 / 7c , and both a + and a_ are arbitrary close to 7/7 c < 1, it is easy to 
see that ([8]) comes from the application of Lemma [I] (i) to a = a + and a = a_. 


5.2 Proof of R n = op(a“ /t ) 

Let us use the same decomposition as in the proof of the negligibility of the term R n 
Worms (2014)| e subsection 5.1.2 there). In other words, we define, for some 


m 


5 ' > 0 , 


C(t) : = 


=-r 


dG(x) 


and h i)U . (C [Z n _ j+i,n)) 2 , 


Jo (1-G(x))=»(l-F(x)) 
and we readily have |i? n | < T r [ , where 


1 I" 71 . 1 

T„ := sup Rd\h{ n {Ci n l)| and :=* — J] W itn hRn ~^. 

-« "' /Crj 




2=1 


Using sharp results of the survival analysis literature, we have already proved in 
Worms (2014)] that T* = 0 P (1). It remains to prove that 


T nK,k = OpW- 

First, from the dehnition of /p n and C, since (1 — H) = (1 — F)(l — G ) we clearly 
have 


hZ l < 


— log(l — G(Z n _i + i n )) 


4+<5' 


1 — H(Z n _i + i, n ) 

Moreover, under assumption (A), 1 — H{x* — •) is regularly varying at zero with index 
—1/7 and — log(l-G(x* — •)) is slowly varying at 0 : therefore, the application of ([9]), 
as well as bound (11) to — log(l — G(x* — •)) and bound (12) to f = G and f = H, 
implies that , for n sufficiently larger, ^ 4 P n Q n where 


Pn ■ = n 2 


1 


log(l - <2(Zn-fc„,n)) 

1 - H(Z n _ kn , n ) 


+- 5 ' 


Qn ■■= 

i= 1 


where /3 = (2y) 1 + 7 C — e', for some e' > 0. 
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We thus need to prove that P n Q n / a“ k = op(l). 


Let 7 ] > 0 and consider constants c > 1 arbitrarily close to 1 and a + > 0 arbitrarily 


close to 7/3 = f + P - ye' > \ + P. We have, 


P 


PnQr 


l n,k 


> r) 1 ^ P max 


7 C 

7P 

J i,n 


% S~) k 'f i 


cu. 


-CL+ 


> 1 


+ P\PnK l 'Z 


U, 


-a+ 


i=l 


*n.k 


> r l 


( 10 ) 


First, Lemma [2] is applied with 9 = (3 and thus the first term of the right-hand 
side of (10) tends to 0. Next, Lemma [I] is applied with a = a + : we thus need to 
distinguish the case yc < y x (for which y/3 < 1) from the case y c 5 s y x (f° r which 
y/3 > 1 when e' > 0 gets small). 

(i) Case y c < y x 

First of all, assumption (K) implies that P„ =^Op(l) (see relation (20) in 

(n) implies that P uf a+ pfP 


(Worms (2014)]). Since a + < 1 in this case, Lemma 

Op(l) and consequently the second term of the right hand-side of (fTbj) tends to 
0. 


(ii) Case y c ^ y x 

In this case a + > 1, therefore Lemma [l] (ii) implies that, for any given S' > 0, 
kn tya++5 ^ Yh=i u 7 a+ Jw" = op(l). Moreover, assumption (K) implies that, for 
8 ' > 0 small enough and a + sufficiently close to 1/2+y/yc, we have k^ l ++s '^ 1 P n = 


Op(l). Hence, the second term of the right hand-side of (10) also tends to 0. o 


6 Appendix 


6.1 Regular variation and Potter-type bounds 

Definition 1 An ultimately positive function f : M + —>• M is regularly varying (at 
infinity) with index aeR, if 

lilt yw~ x ‘ {Vx > 0) ' 

This is noted f e RV a . If a = 0, / is said to be slowly varying. 


Remark 4 Regular variation (and slow variation) can be defined at zero as well. 
A function f is said to be regularly varying at zero with index a if the function 
x —> /( 1 /x) is regularly varying at infinity, with index —a. 


Proposition 2 (See /Haan and Ferreira (2006)] Proposition B.1.9) 

Suppose f e RV a . If x > 0 and 81,82 > 0 are given, then there exists to = ^( 81 , 82 ) 
such that for any t ^ t 0 satisfying tx ^ to, we have 


(1 — <5i)x Q min(a : <52 ,x 52 ) < 


f(tx) 

m 


< (1 + 8 i)x a max(i & , x S2 ). 
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If x < 1 and e > 0, then there exists t 0 = t 0 (e) such that for every t ^ to 


(1 - e)x a+£ < < (1 + e)x a ~ £ 

fit) 

and if x ^ 1 , 

(1 - e)x a ~ £ < 4^4 < (1 + e)x a+£ . 
fit) 


Corollary 4 If f is a positive function with end-point x*, such that t —» 1 — f(x* — t) 
is regularly varying at 0 with index a, i.e. 


1 — ffx* — tx) 
1 - fix* - t) 


x a , as t —> 0, 


for some a e then for every e 
VO < x < 1, 

(1 - e)x a+£ ^ 


> 0, there exists t 0 > 0 such that, VO < t < to 
1 — fix* — tx) 


1 - fix* - t) 


< (1 + e)x c 


( 11 ) 


and 


(1 - e)x~ a+£ ^ 


fjx*-t) 
1 — fix* — tx) 


< (1 + e)x~°~ £ 


( 12 ) 


Below, U corresponds to the quantile function associated to H introduced in para¬ 
graph 2.1. 


Corollary 5 If U satisfies condition (§. then for every e > 0, there exists t 0 > 0 
such that, VO < t < t 0 , Vx ^ 1, 


(1 - e)x 7 ~ £ ^ 


U (oo) - U(tx) 
U (oo) - U(t) 


< (1 + e)x 7+£ . 


(13) 


Proposition 3 (see (Haan and Ferreira (2006)1 Theorem B.2.18) 

If U satisfies condition (|5j) with the positive function a, then there exists a function 
q 0 equivalent to a/U at infinity such that Ve > 0, 3 1 0 > 0, Vt ^ t 0 , Vx ^ 1, 



7+e ^ log U(tx) - log Uit) 

ex ' V -^- 

Qo(t) 


x 7 - 1 

<c - 

7 


+ ex 


7+e 


(14) 


We now proceed to the proofs of the different lemmas stated previously and finally 
of Proposition [T| . 
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6.2 Proof of Lemma |T| 

Let c n := ( qo(n/k n )/a nt k) a (which tends to 1 as n —* go) and (Y t ) be a sequence of 
i.i.d standard Pareto random variables. Let 


LLi k ■ — 


log (U(Y n _ i+1 , n )) - log (U(Y n _ kn , n )) 

QO 0- n—k n ,n) 


and QQ i}k 


n — i-f l,n 
^ n — k n ,n 


7 

- 1 


7 


For every 1 ^ i ^ k n , we thus have 


where 


= i((LL itk ) a -(LL i+1}k ) a ) 


oii, n ■■= i (( QQi, k ) a - {QQi+i,k) a ) 

Pi,n • ^(-®fc,n(^) -®fc,n(^ 7 1)) 

B k , n (i) := (LL itk ) a - {QQi, k ) a , 


with B k , n [k n + 1) = 0. 

Our hrst step will be to prove that (with 5' > 0) 


kr 


1] U i a Pi,n or 7^2 tq a /3 itn is op(l) whether 0<a<lora>l. 


i= 1 


La+5' 

K n i= i 


Using bounds ([14]) for some e' > 0, with t = Y n _ kri . n and x = Y n _ i+hn /Y n _ kri:n > 1, 
and relying on the mean value theorem, we easily prove that, since a ^ 1 and 7 < 0, 


\{LL hk ) a - (QQi,kT\ ^ ce' (^P±1y\ 

V r n —k n ,n J 


7+e' 


^ ce' 


(15) 


for some constant c (close to alyp 
1 < i ^ k v . Since I B kn (i)\ = 


k,n 

k 


\ ''kfi @j,n 

^ j=i ~j~ 


Therefore \B k , n (i)\ = Op(l), uniformly on 


1— 'W n ,, a fj. 

+ 1 Z_ii=l u i ri,r 


k n ~\~ 1 


, we thus have, when 0 < a < 1, 

kn — CL 

- U i < 

kn Pi,n C u i —a 


IZi ^-fn]- a ds 


= (1^) 

\ u ’ s~ a ds 






< op(i)isS:i«r a 

= Op(l). 

The proof for a > 1 is similar (see end of subsection 5.2.1 in | Worms (2014) for more 
details, with the difference that now \B kt n(i)\ = Op(l) uniformly in i). 
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Since we have dealt with the j3^ n part, the lemma will be proved as soon as we obtain 
that, when 0 < a < 1 , 


k, 


V j i C^i,n 


(1-a) 


i—a —1 


Beta 


2=1 


1 — a 


; a + 1 


(16) 


1 


and, when a > 1 , 

^7 2 ,n — 0. (17) 

71 2=1 

From now on we will sometimes write k instead of k n . Let ( Ei ) be a sequence of 


i.i.d standard exponential random variables. According to (23), and by applying the 
mean value theorem, there exist some random variables E* k e [E k _ itk , E k _ i+ i }k ] such 
that (remind below that 7 is < 0 and a ^ 1 ) 


&i,n ^ 


o^^k— i+l,fc^ 


g'yEk—i,k _ 1 


Q( % ^ .tls -t—^k — 2 , k ) ^ 


7 Ef k ( l-e 7 V- 
l7l 


a—1 


= a\ 


11 —a 


X * (E k _ i+1 , k - E k _ itk ) { «[ 7| ( 1 - 4 71 ) 0 - 1 + A i)n } (18) 


where 


A ? - n := e 7E **(l - e^y- 1 u^\ 1 - uj 71 ) 01 - 1 . 

We will prove later that 

max |A i>ri | = op(l). 

i^k n 


(19) 


For the moment, note that (i (E k _ i+1>k - E k _ ijk )) i<kn = (fi)i^ kn due to the Renyi 
representation, where (/)) denotes a sequence of i.i.d standard exponential random 
variables. Moreover, application of the law of large numbers for triangular arrays of 
independent random variables (cf [Chow and Teicher (1997)j ; details are omitted) 
implies that, when 0 < a < 1 , 

1 kn -1 kn pi 

-r Y. uk a h = Op(l) and — V u^(l - u^ 1 ^ x^~ a (1 - x^) a ~i dx 

Kn i= 1 7 = 1 J 0 

( 20 ) 

and, when a > 1 (and 5' > 0 is given), 


k , ci~\~5 r 

A/ • 1 

71 2=1 


“/> = Op(l). 


( 21 ) 


Considering first the situation 0 < a < 1, combining (18), (19) and (20) shows that 


relation (16) will hold as soon as 


°(1 —x^)" 1 dx = a 1 (1 — a) |y| 2 Beta 


1 — a 


; « + 1 1 . 
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Use of the formulas Beta(u, v ) = r(u)r(u)/r(u + v) and uT(u) = T(u + 1) proves the 


latter relation. When a > 1, relation (17) is a consequence of (Q 8 b, (19) and (21) . 


It remains to prove relation (|19j) . For this purpose, we introduce the sequence 
(Vi) of i.i.d. standard uniform random variables such that V),fc = e ~ Ek ~ i+1 ’ k , and we 


note V* k := e Ei ’ k . 


If we set Wi : = vjl\ W* 


:= (V* k )^ and Vi := u \ 7 ', then relation 


(19) is now 


M n : = max 
1 


w*u-w; 




Vi( 1 


\a —11 


— Op(l), 


with 0 < Wi ^ W* ^ W l+ \ < 1. Unfortunately, the function x >-> x(l — x) a ~ l is not 
uniformly continuous on ]0,1[ if a is smaller than 2. Until the end of the proof we 
will note max* instead of maxi^,^. Since \x — a\ ^ max{|?/ — a\, \z — a|} whenever 
\x — y\ < \z — y | (this yields the first inequality below), we have 

M n max { max, |Wj(l - W i+ 1 )" _1 - u,(l - v^*- 1 ] 5 

max, |W i+ i(l - Wi)*- 1 - Vi( 1 - Vi )“" 1 | } 

^ max { max, |W*(1 - W i+ 1 )"" 1 - W,(l - W^ 1 ] ; 

max, \(W i+ 1 - Wi)( 1 - Wi)*- 1 ] ; max* |W(1 - W,)*- 1 - Vi (l - 1 } 

< max { max.j | (1 - W i+ i) Q_1 - (1 - fU) Q_1 | ; max, \W i+ i - Wi\ ; 

max, |Wi - Vi\ ; max* |(1 - W)" -1 - (1 - Vi) a ~ l \ } 

= : max { M nA ; M n>2 ; A4,3 ; M„ j4 } . 

Now, since |x“ — y“| ^ (a a l)|x — |/| avl whenever x and y belong to [0,1] and a > 0, 
we have M nx < cM£ 2 and M n 4 ^ cM 3 for some positive constants c and c' . On the 
other hand, M n2 is bounded by 2M„ ]3 + max, |u i+1 — u,| = 2M n3 + o(l). Therefore, 
the negligibility of M n amounts to 


M n 3 = max 


1 fcri 


u 


bl Jvl 


M- 


= op(l). 


This property is proved in details in |Beirlant et al. (2002) | (page 164, with — p instead 
of I'y|), so we do not reproduce it here. 


o 


6.3 Proof of Lemma [2] 


If Yi,n, ■ ■ ■, Y n , n denote the ascending order statistics of n i.i.d standard Pareto random 
variables, we have 


~ d U(oo) U(Y n _ i+ i, n ) 

£7(oo) - U(Y n _ kn , n ) ' 


Applying bounds (13), it comes, for some given e' > 0 and n sufficiently large, 


/ £7(oo)-£7(y w _ <+1 , n ) y < 
v £7(00) - E7(W-fc n ,n) ) 
f U(oo)-U(Y n _ i+ i, w ) \ y > 
^ ^ (®) ^ U(Y n _ kn , n ) ) 


(l + e') d ( Y ^Y + 
\ Y n —k„,n ) 

(X — e) e ' ( ^ n ~ i + 1 ’ n | 

V Y n _ kn U J 
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where 


a + = (7 + e')9 and a_ = (7 — e')9'. 
We finish the proof as for Lemma 1 of |Worms (2014)]. 


o 


6.4 Proof of Proposition [T] 


(i) First, note that for any sequences (a n ) and ( b n ) such that a 0 = 0 and b k+1 = 0, 
we have XiLi a i( b i - b i+ 1) = ZLiK - a i-i) b i- By letting a { = * L -r and 

1 ^n^ n _ i+ ljTl ) 

— inn- Q ( Z n-x+i,n \ we } iave ^ fc+1 _ q j )U t aQ i s undefined, so we set ao = 0. Therefore, 

Zin .— k.n “ ' 


hi = log“ (- 
by the definition (|3|) of £j :n , this implies that 


\ _ 

2-ii =1 1 - 




Gn (^n—i+l,n) 


= IZ 2 FT 


2 — 1 


dn(-^ n _ i + 1 „) 1 G' n (Z n _ i+2 ) 


+ 


log 0 


^n—i 


+ l,n 


1 — Gn(Z n , n ) 


log 1 


Oi I ^n,n 

^n — k n ,n 


By definition of M^ n and , part (i) of Proposition [l] will be proved as soon as 
we show that, for 2 ^ i ^ k n , 


i — 1 


hn—i+l,r 


1 - G n (Z n _ i+ln ) 1 - G n {Z n _ i+2 n ) 1 - G n (Z n _ i+l n ) 


Note that 


2—1 


1 G n ( Z n _ i+lrl ) 1 G n ( Z rl _ i+2n ) 

1 In (a 1 \ ^ n V^n — 2 + 1, 


1 G „( Z n _ i+ln ) 


1 G "( z n -i+i,J 

1 ' 1-<5»(Z^«,») 


( 22 ) 


where 


1 - G n (Z n _ i+l n ) 


n n-i ( n~j 
3 = 1 U-J+1 


1 — So 


1 - G n (Z n _ i+2 ) n n T i+1 ( n ~ j \ 

1 B=i yn—j+i y 


1 — 


The right-hand side is clearly eq 


fi¬ 


lial to 4-y 1+1,71 , which may be rewritten as 


. Replacing in (22) concludes the proof of (i). 


(ii) Note first that 


D { ± =T-A n C, n W, n , 


n,k n 7 

n>n 

where A n and C ltU are defined in the beginning of the proof of Theorem [l] and 

w lin = 1 7 g l z i fcn ; n) iog° 


1 — G(Z njn ) 


J n—kn,n 
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We know that A n = op(l). Since C\^ n = Op(l) (see Theorem 2.2 in Zhou (1991)] ), 
we only have to prove that -r-^sr 21 = op(l). For the same reasons that led to (9), for 

Kn a n,k 

any e > 0 and n sufficiently large, we get W\ , n ^ W^ n with 


= (1 + e) log 0 


Z[ 


7 c 1 -e 


. Zn—k n ,n, 

If (Yi)i^ n is an i.i.d. sequence of standard Pareto random variables, then 


— 1 1,5 


-n a 
u n,k 


= (1 + e)(LL ltk ) a (U 1)k yo-*, 


where c n —*1 and LL\ k were defined in the proof of Lemma 1 and U\ k = \ ■ 

’ UyJJ) ^ ^ n — k n ,n) 

On one hand, Potter bounds (13) yield for any e' > 0 and n sufficiently large, 


{U^yc (1 + e') 


Y n 


Yn — 


n—kn,n 


with a + = (7 + e / )(7c — e )- On th e other hand, (15) implies that for e" > 0, 
\(LLi t k) a — {QQi,k) a \ 7) ce", for some constant c, where QQ^k is also defined in the 
proof of Lemma [lj Moreover, it is known that 


n—i+l,n/Y n —k n ,n) 0^k n —i+l,k n )l 


(23) 


where Y l kn , ■ ■ ■, Y knkn are the ascending order statistics of k n i.i.d random variables 
Yi,..., Y kn with standard Pareto distribution. 

Consequently, (LL^ k ) a {U l)k yc- e <: Q l n , with 


Ql,n — (1 + C 7 ) 



(Xk n ,k n ) a + ce"(l + e')(Y fenjfc J 


Since Y kn kn ^ 1 and 7 < 0, the right-hand side of the relation above is lower than 
d(Y kri ,k n ) a+ , f° r some constant d. 

Now, Standard Pareto distributions having moments of order less than 1, 

Yk n ,k n = max Y i>kn = o P (k p ), 

1 kf'i 


for any p > 1, therefore, ^-( Y kni k n ) a = Op(k pa+ x ), which is clearly op(l). This 
concludes the proof. o 
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